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ABSTRACT 

We present a method to estimate distances to stars with spectroscopically derived stellar parameters. The 
technique is a Bayesian approach with likelihood estimated via comparison of measured parameters to a grid of 
stellar isochrones, and returns a posterior probability density function for each star’s absolute magnitude. This 
technique is tailored specifically to data from the Large Sky Area Multi-object Fiber Spectroscopic Telescope 
(LAMOST) survey. Because LAMOST obtains roughly 3000 stellar spectra simultaneously within each ~ 5°- 
diameter “plate” that is observed, we can use the stellar parameters of the observed stars to account for the 
stellar luminosity function and target selection effects. This removes biasing assumptions about the underlying 
populations, both due to predictions of the luminosity function from stellar evolution modeling, and from 
Galactic models of stellar populations along each line of sight. Using calibration data of stars with known 
distances and stellar parameters, we show that our method recovers distances for most stars within ~ 20%, but 
with some systematic overestimation of distances to halo giants. We apply our code to the LAMOST database, 
and show that the current precision of LAMOST stellar parameters permits measurements of distances with 
^ 40% error bars. This precision should improve as the LAMOST data pipelines continue to be refined. 

Subject headings: Galaxy: stellar content. Galaxy: structure, stars: distances, surveys (LAMOST) 


1. INTRODUCTION 

The Large Sky Area Multi -object Fiber Spectroscopic 
Teles cope (LAM OST J survey (iCui et al.l 120121 iDeng et aH 
120121 iLuo et alll2012l Izhao et al.ll2012h has thus far obtained 
medium-resolution (R ~ 2000) spectra for over 3 million 
stars, on its way to a goal of acquiring some 6-8 million stellar 
spectra over a planned 5-year survey. Such a vast survey pro¬ 
vides an invaluable resource for studies of Milky Way stellar 
dynamics. LAMOST data have been us ed to explore kine - 
matical asymmetries in the near by disk dCarlin et al.l 1201 3h. 
nearb y stellar movin g groups jz lao et S.l 120141 IXia et al.l 
l2015h, high-velocity (IZhong et all 2014t) and hyper-yelocity 
stars (IZheng et al.l 120141), and stel lar kinematics in the so¬ 
lar neighborhood (ITian et al.ll2015h . More distant halo struc¬ 
ture is also accessible yia red giant branch (RGB) stars ob- 
seryed by LAMOST. Howeyer, RGB stars can be difficult to 
identify among the much more numerous foreground dwarfs. 
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iLiu. C.. et al.l (1201 4a^ developed a method to select K-type gi¬ 
ants from their LAMOST spectra, and a technique to identify 
M giants (which are not processed in the main L AMOST stel¬ 
lar par ameters pipeline) has been developed by IZhong et al.l 
JIOll : see also the study of the Sagittarius tidal stream M 
giants by Li et al., in preparation). All of these studies re¬ 
quire an estimate of the distances to the stars involved in order 
to place them within the structure of the Galaxy. To use the 
LAMOST data to understand the dynamics of the Milky Way, 
it is thus vitally important to devise a method for determining 
stellar distances. 

One of the main outputs of a survey such as LAMOST 
is the spectroscopically derived line of sight (radial) veloc¬ 
ity (RV) for each star. In order to exploit this velocity in¬ 
formation as a probe of Galactic dynamics, one must know 
the three-dimensional position of each star within the Galaxy. 
The position on the sky is, of course, well known, but estimat¬ 
ing the third dimension of each star’s position (its distance) is 
non-trivial. Furthermore, even if reliable proper motions are 
known for each star, combining these with the RVs to derive a 
three-dimensional space motion for each star requires know¬ 
ing its distance. A survey such as LAMOST inevitably con¬ 
tains numerous nearby (mostly disk) dwarfs, with a smaller 
fraction of intrinsica lly bright RGB stars (e.g., K giants; see 
ILiu, C.. et al.ll201^ that can be used to study more distant 
structures in the halo of the Milky Way. Thus, in order to 
fully exploit LAMOST data for studies of Galactic structure, 
we require not only reliable estimates of stellar parameters, 
but a robust estimate of the distance to each star, based on 
available photometry and information gleaned from its spec¬ 
trum. 

For each star whose spectrum has suffi cient signal-to- 
noise (S/N), the LAMOS T pipe line (see, e.g.. lWu et al.ll20lTt 
iLuo et al1l2012t IWu et al.ll2014h derives effective temperature 
(Teff), surface gravity (logg), and metallicity ([Fe/H]). When 
these stellar parameters are known for a single star, it is typical 
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to compare the measured parameters to theoretical isochrones, 
and use the best theoretical match to estimate the absolute 
magnitude of the star in question. This is fairly straightfor¬ 
ward for a single star, but to robustly determine distances by 
this technique for large numbers of stars in an automated way 
presents a challenge. 

Recently, a variety of techniques have been presented 
for deriving distances to large numbers of stars with stel¬ 
lar parameters resulting from spectro scopic surveys such as 
RAdial Velocity Exper iment (RAVE; iStdnmetz et ah 1200^ 
IKordopatis etal]|2^ . SDSS/SEGUE (lYannv et al.l I20Q9D . 
and SDSS/APOGEE. These include a minimization rou¬ 
tine that c ompares RAVE stell ar parameters to a grid of 
isochrones (iBreddels et al.ll2010l) . and a modification of this 
technique (IZwitter et al.ll2010h to account for the stellar lu¬ 
minosity function of the isochrones. Bayesian methods that 
include models of Galactic stellar pop ulations as priors (e.g., 
[Burnett et al.l201 itiBinnev et al.l201^ have also been applied 
to RAVE data. Distances presented by the SEGUE Stellar Pa¬ 
rameters Pipelin e fSSPP: | Lee et al.ll2008h are based on em¬ 
pirical fits from iBeers et al.l (l2000l) to globular cluster fidu¬ 
cial sequences, and require separate calibrations for stars in 
different evolutionary stages. An alternate method that has 
been applied to SEGUE halo K-giants uses a Bayesian ap¬ 
proach to ac count for the lum inosity function and metallicity 
distribution (IXue et al.ll2014l) . Distances for stars observed 
by APOGEE have thus far b een limited to wel l-characterized 
stars such as red-clump stars (iBovv et alJ2014ft and red giants 
in the Kepler field tha t have asteroseismic surface gravities 
(iRodrigues et al.ir2014l) . 

Our method of deriving distances from LAMOST spectro¬ 
scopic parameters was chosen to avoid introducing assump¬ 
tions about stellar populations and their distribution in the 
Galaxy. We simply want the best empirical estimate of the 
distance, so that we can use this to explore the distribution of 
stellar populations in the Milky Way. The distribution of ob¬ 
served stellar parameters will be biased by the method used to 
select targets (e.g., the color and magnitude selection criteria), 
the intrinsic properties (including the stellar parameters and 
luminosity function) of the Galactic sub-populations sampled 
in each observed region, and perhaps even the observing con¬ 
ditions under which each spectrum was obtained. One benefit 
of LAMOST is that each star is observed as part of a “plate” 
on which ^ 3000 stars are simultaneously targeted. Each plate 
covers a narrow magnitu de range and has a simple target se¬ 
lectio n function (see, e.g.. lCarlin et al.ll2012HLiu. X-W„ et'an 
120141) . making it possible for us to use the observations them¬ 
selves to account for both the selection function and the stellar 
luminosity function in our distance estimates. In practice, the 
selection function has not remained the same throughout the 
LAMOST survey, making it impossible to back out the prob¬ 
ability of observing a star with given properties explicitly. We 
account for the effects of target selection by using the em¬ 
pirically measured distribution of stellar parameters on each 
plate as a prior for the likelihood of finding a star of a given 
surface gravity at a given color and magnitude. In this way, 
we are explicitly including the observed logg for all stars on 
a plate to derive an estimate of our expectations along each 
line of sight, thus removing selection biases and the effects of 
differently sampled Milky Way populations along each line of 
sight. 

This paper is outlined as follows. Section |2] discusses the 
techniques we have developed for deriving distances to stars 


with LAMOST stellar parameters. In Section|3]we verify the 
effectiveness of our technique using several catalogs from the 
literature, as well as simulated data sets. We follow with a 
brief illustration of the results from applying our algorithms 
to the entire LAMOST data set of ~ 1.8 million stellar spectra 
in Section |4] Einally, we conclude with some remarks about 
the utility of this method for Galactic structure science with 
LAMOST. 

2. DISTANCE DETERMINATION METHODS 

We derive distances to stars by comparing measured stellar 
parameters to a grid of synthetic isochrones. Initially, a simple 
method is tested. Though the results from this algorithm 
seem reasonable, it is difficult to derive reasonable uncertain¬ 
ties. We thus turn to a Bayesian technique, which also has the 
advantage of allowing the priors to be easily adjusted in the 
future as desired. 

2.1. Adopted Isochrones 

We began by creating a grid of i sochrones from th e Dart¬ 
mouth Stellar Evolution Database (iDotter et al.ll2008h . This 
particular set of isochrones was chosen in part because it 
more accurately reproduces the lower main sequence in 
SPSS colors than other systems (lEeiden & Chabov^ 120121 
lEeiden et al.ll20l4) . and in part simply for the convenience 
with which one can generate a custom grid of isochrones in 
the Dartmo uth systemF^ We tes ted our programs with Padova 
isochrones (iGirardi et al.l 12003) . and found that the differ¬ 
ences in derived distances are less than a few percent, with 
most of the discrepancies at the cooler end of the main se¬ 
quence. Our adopted grid contains isochrones ranging from 
-2.5 < [Ee/H] < -1-0.5 in 0.1 dex increments, and 1-15 Gyr in 
linearly spaced 1 Gyr increments. All isochrones were gener¬ 
ated with [a/Ee] = 0.0. Isochrone grids were generated for the 
SDSS ugriz and the UBVRIJHKsKp photometric bands, in or¬ 
der to allow for the use of a variety of input magnitudes to de¬ 
rive distances. We removed low-mass stars (M < O.4M0) and 
all evolutionary stages other than main sequence, subgiant, 
and RGB. Other evolutionary stages (e.g., horizontal branch) 
are not well classified at present by LAMOST spectra, and 
are also not well represented in the isochrones, so we chose to 
excise them and keep only “normal,” well-behaved stars. 

Before using the grid of isochrones for derivation of dis¬ 
tances, it was interpolated onto a regularly spaced distribu¬ 
tion in absolute magnitude. Because we need an absolute 
magnitude that corresponds to each metallicity, age, surface 
gravity, and temperature in the grid, we begin by creating 
a dummy array of absolute magnitudes spanning the rele¬ 
vant range (8 > Mks > -8 for 2MASS colors/magnitudes, 
and 12 > Mr > -4 for SDSS), in increments of 0.02 magni¬ 
tudes. Eor each combination of the 15 age steps and 31 steps 
in [Ee/H] making up our grid, we then use a cubic spline in¬ 
terpolation to map the effective temperature (Teff) and surface 
gravity (logg) behavior as a function of absolute magnitude. 
In this way, we create a grid with identical values of absolute 
magnitude for each age/metallicity combination, which then 
simply reflects the Teff and logg of a theoretical star at that 
age/metallicity that would have each value of absolute magni¬ 
tude in the array. In other words, at each absolute magnitude, 

' * Unlike most available isochrone system s, the Dartmouth isochrones in¬ 
clude atomic diffusion fsee lDotter et ali2008l and references therein), and are 
thus more reliable for age estimates. While measuring stellar ages is not a 
current concern of ours, we may wish to use the grid for this purpose in the 
future. 
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we create arrays with all combinations of age, [Fe/H], temper¬ 
ature, and surface gravity that are predicted by the isochrone 
grid. 


2.2. Chi-squared Technique 

The goal is to take the measured stellar parameters Teff, 
logg, and [Fe/H], along with known photometric magnitudes 
and colors, and deriv e a distance to each s tar. We employ 
near-infrared 2MASS (iSkrutskie et al.ll200^ magnitudes and 
colors from here onward, but these can be replaced with 
magnitudes from any other system (e.g., SDSS) for which 
the Dartmouth isochrones have been calculated. We chose 
2MASS because ~ 97% of the objects in the LAMOST cata¬ 
log have matches in 2MASS, thus providing a uniform input 
catalog (note that the majority of the objects that do not have 
2MASS counterparts are at the faintest magnitudes reached by 
LAMOST). The use of 2MASS also simplifies comparisons 
to other catalogs that may not overlap the SDSS footprint or 
magnitude range. 

Assuming we have measured input parameters (“observ¬ 
ables”) Teff, logg, [Fe/H], and (J-Ks) (or any other input 
color), and associated errors crFeff, criog(g), crpe/H] > and (J(j-Ks), 
we can define a statistic: 


X =2.—-2-> (1) 

,=1 

where the (9, are the observables (n = 4 in this example) with 
associated errors ug,, and (9; mod are the isochrone model pa¬ 
rameters corresponding to each observable. To determine the 
best value for our data, we find the model point at which x^ 
is a minimum. The distance modulus then simply consists of 
the difference between the model star’s absolute magnitude at 
this best-fit point and the input (measured) magnitude. 

To account for errors on the observables, while also deriv¬ 
ing an uncertainty on our derived distance, we resample N 
times (we adopted A=100 per star after confirming that this 
relatively small number of samples produces nearly identi¬ 
cal results as A = 1000 per star, while keeping computation 
times reasonable) from within Gaussian-distributed errors on 
each of the parameters, then minimize x^ for each sample. 
The mean and standard deviation of the probability distribu¬ 
tion function (PDF) for the absolute Ks magnitude {Mk) from 
this Monte Carlo process are measured for each star. We then 
combine these with the observed magnitude to derive the 
distance and its error. At this point, the uncertainties on many 
of the distances were found to be unrealistically high (often 
greater than 100% for stars from LAMOST); this is likely be¬ 
cause the errors on the stellar parameters quot ed in the LAM- 
OST catalog are overestimated (as found by iLee et al.1120151 
via comparison to SDSS spectra of stars in common between 
the surveys). Although the grid spacing of 0.1 dex in [Fe/H] 
is half the smallest metallicity uncertainty we have considered 
(tT[Fe/H] = 0.2), it may also be that finer grid spacing (in both 
age and metallicity) would reduce the scatter in Monte Carlo- 
resampled distance estimates. Furthermore, though we tested 
resampling A = 100 and 1000 times and found little effect on 
the derived distances and their errors, it is also possible that 
even larger samples are required to fully reproduce the cor¬ 
rect PDF of Mk for each star; this was not explored further 
because of the computationally prohibitive cost of resampling 
10,000 or more times per star. 


2.3. Bayesian Technique 

In order to obtain a posterior PDF for the distance rather 
than a simple distance estimate with associated error bar, we 
adopt a Bayesian method. This more readily allows for statis¬ 
tical studies of Galactic populations that require the full PDF. 
We choose to keep th e priors in our method extremely simple , 
unlike methods (e.g., iBurnett et al.ll201 ll iBinnev et al.ll2014l) 
that consider priors based on models of Milky Way stellar 
populations. Because we intend to use the derived distances to 
study the kinematics and density distributions of Milky Way 
stellar populations and their metallicity distribution functions, 
we wish to avoid priors that assume any uncertain properties 
of these populations. It is simple to incorporate more complex 
priors into our algorithm in the future, should we wish to do 
so. 

Consider a vector of observed stellar parameters O = 
(Teffjlogg, [Fe/H]) = (TjGjZ) with measurement errors cto- 
Assume that these can be mapped via stellar models 
onto a vector of intrinsic properties X = (age, mass, 
metallicity)= (A,Mo,Zn) that together determine the evolu¬ 
tion of each starPl These intrinsic properties combined with 
the stellar models give the absolute magnitude distribution 
Mabs(A,Mo,Zo). The mapping from observables O to Mabs 
represents a convolution of the intrinsic luminosity function 
and the ways in which the selection function has sampled this 
luminosity function. We will denote the selection function 
as S. Stellar models relate A to O; we do not include any 
explicit dependence of model parameters on A because we do 
not know anything about the ages of observed stars in advance 
(i.e., we take a uniform prior p{A) =1). Given a set of stellar 
models, observed stellar parameters, and the selection func¬ 
tion, a PDF for absolute magnitude can be derived. The full 
PDfis: 


p(Mab.s,0,tTo,5) = p{M^hs\0,ao,S)p(0,ao,S) (2) 

This can be rewritten as: 


p(Mabs ,0,(7o,S) = piM^bs\0,ao,S)piO,ao,S) 

=/2(0 |Mabs, CTo)p(cro |Mab.s)p(Mabs 15)p(5) , 

(3) 

where in the first term on the right-hand side, we have re¬ 
moved the dependence on S, because once the star has been 
observed, the likelihood p(0|Mabs,cro) no longer depends on 
the selection function. Rearranging, we obtain: 


p(Mabs|0,o-o,5) 


p{0 |Mabs, <Jo)p(ao |Mabs)p(Mabs | S)p{S) 
p{0,ao,S) 


(4) 

We assume that the measured errors cto are independent of 
Mabs. This may not strictly be true - the errors may indirectly 
depend on intrinsic properties of the star (e.g., low-metallicity 
stars may have larger az, or giants near the RGB tip may 
have higher ac), which in turn affect the Mabs. However, 
this should be a minimal effect for our purposes, so we take 
p((To|Mabs) to be independent of Mabs (i.e., this term becomes 


The Dartmouth isochrones are given explicitly in [Fe/H] rather than Z. 
However, the values of Z and ¥ (the helium fraction; calculated as 7 = 0.245 + 
1.54Z) are given in the header of ea ch isochrone, with d etails about the initial 
compositions of the models given in IDotter et al.ll200ll . Thus, if desired, one 
can use the stand ard definition of [Fe/H]= l og(Z/X)/(Z/X)(b, with standard 
solar values from IGrevesse & Sauvall 1199811 of (Z/X )0 = 0.0267 (initial) and 
(Z/Z)q = 0.0229 at the current solar age, to find Z. 
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Fig. 1.— Examples of the posterior PDF values from the Bayesian distance 
code. Each panel shows the likelihood distribution deriv ed from the stellar 
parameters for a randomly chosen star from the Besangon IRobin et al.l20Q3n 
model catalog. Red squares represent the 15%, 50%, and 85% values in the 
cumulative PDF, which we take as the best estimate (50%) and error bars 
for the absolute magnitude. Dashed blue lines in each panel show the “true” 
absolute magnitude given by the Besangon model. 


pio’o))- We also neglect the denominator, p(0,ao,S), which 
contributes only a normalization factor. The term p(M^\,ji\S) is 
the absolute magnitude distribution given the selection func¬ 
tion. If there is no selection function, this term would be the 
luminosity function. This leaves us with a final expression for 
the posterior PDF of 

P(Mabs|0,Cro,5) OC p(0|Mabs,Cro)p(Mabs|5)p((To)p(‘S) (5) 

We take the likelihood p(0|Mabs, (Zo) to be Gaussian in each 
of T,G, and Z, i.e., 

3 

p(0|Mabs,cro) OC ]^exp[-(6),-6)niod)V2o'o,/], (6) 

i'=l 

where i= 1 - 3 to indicate the inclusion of T, G, and Z in the 
product, and (9mod indicates the corresponding parameters for 
the model isochrone grid (and thus implicitly Mabs). In prac¬ 
tice, this is accomplished by calculating this Gaussian residual 
for the input star relative to every point in the model isochrone 
grid. Because every point on the grid with its parameters Omod 
has an associated Mabs, the likelihood given by this product of 
Gaussians can be mapped to a likelihood distribution in Mabs. 

The term p(Mabs|5) in Equation|5]encodes the relative num¬ 


bers of stars as a function of Mabs, given the selection func¬ 
tion S near each star’s line of sight; i.e., how likely a star of 
a certain Mabs is to have made it into the sample given the 
catalog of observed stars. This term is necessary because the 
entire luminosity function is not sampled by a given region 
of color-magnitude space. To account for this selection ef¬ 
fect (which depends on color and magnitude, but more im¬ 
portantly on position in the sky), we derive an empirical cor¬ 
rection based on the stars actually observed in a given LAM- 
OST plate. To do so, we select stars from the same LAMOST 
plate that are within 0.25 magnitudes in color and magnitude 
(e.g., Ks,o and {J-Ks)o, or whatever color-magnitude system 
is being used to derive distances) of the star of interest. For 
each nearby star, we generate a Gaussian centered at its mea¬ 
sured logg with width equal to its associated error, triogg, and 
then normalize its sum to unity to create a PDF. We create a 
generalized histogram by summing these PDFs for all of the 
color-magnitude selected stars in the plate, then normalize the 
resulting logg distribution to yield the probability of finding a 
star at a given logg value in the vicinity (in both position and 
color-magnitude) of the star of interest. This logg distribution 
is then mapped onto each input isochrone via interpolation of 
the log g-Mabs relation of the isochrone itself. The resulting 
histogram in absolute magnitude is normalized to provide the 
probability of finding stars of a given Mabs based on the mea¬ 
sured logg distribution. We incorporate this probability distri¬ 
bution as p(Mabs|5) in Equation|5]to properly account for the 
underlying luminosity function along each line of sight, and 
the selection effects of the survey (which corrects for both the 
fraction of stars that were selected as a function of color and 
magnitude as well as the volume of the Galaxy sampled by 
the selected stars). 

In the absence of a selection function, p(Mabs|5) could 
be represented by the theoretical luminosity function of the 
isochrones. Eor general usage, we include this feature in the 
code for instances where the parameters of nearby stars are 
not known. In the Dartmouth isochrones, the density of points 
along each isochrone as g iven encodes equal steps in “equiva¬ 
lent evolutionary phase” (iBertelli et alJll990l) . Assuming that 
this roughly mimics the luminosity function, we calculate the 
normalized density of points as a function of absolute magni¬ 
tude for each isochrone in histogram bins of 0.2 magnitudes, 
and use this as the starting estimate for p(Mab.s|5) when there 
are not sufficient nearby, simultaneously observed stars to use 
for the selection function correction. In tests of the effect of 
the selection function correction using 20,000 stars, we found 
that the fractional change in distance between measurements 
with/without this correction was less than 10% for 93% of the 
stars, and less than 20% for 97% of the stars. However, a 
small number of stars ( 1.4%; mostly red giants) had their dis¬ 
tances change by more than 30% between these two methods. 
As expected, then, the correction is important for the much 
rarer RGB stars than for the ubiquitous nearby dwarfs in the 
LAMOST database. 

The algorithm to evaluate each of the three terms on the 
right side of Equation |5] produces an array of points corre¬ 
sponding to all of the theoretical stars in the isochrone grid. 
Each of these points has an associated Mabs and a likelihood 
value. Thus Equation |5] also produces an array of posterior 
PDEs for a large grid of absolute magnitudes. We sum the 
PDE for each Mabs value to produce a marginalized PDE in 
Mabs for the input star. This is normalized to produce the fi¬ 
nal PDE p(Mabs|0 ,(To, 5); some examples are shown in Eig- 
ure[T] We take the median (i.e., 50th percentile) of this PDE 
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{^LAMOST"^Besancon)^^Besancon tl0gsan(;on (kpC) 

Fig. 2.— Comparison of distances derived by our two algorithms for Besan^on model galaxy stars in a simulated field at (l,b) = (180°,60°). The left panel 
shows the residuals from comparison of the Besan9on model distances, dBesancon, to distances derived from the model stellar parameters using our code, ^LAMOST- 
The solid black line is the output from the algorithm, and the dashed gray histogram is from the Bayesian code. The results from the Bayesian method have 
slightly larger scatter than those from the algorithm, but little systematic offset is seen in either method. The right panel directly compares the input (Besan 9 on) 
distances to those recovered by the code (^LAMOST)- Open symbols are distances, and filled symbols are results from the Bayesian code. The agreement is 
good out to d ~ 15 kpc. Beyond this distance, our algorithms systematically overestimate the distances by ~ 20% for the majority of stars. We believe this 
discrepancy is due to our use of isochrones with solar [a /Fe]; because metal-poor halo giants are typically alpha-enhanced relative to Milky Way disk stars, a 
more appropriate alpha-enhanced isochrone grid may remedy this deficiency for halo stars (see Section 4.2 and Figure|2]for further exploration of this effect). 


as the best estimate forMabs, with uncertainties derived using 
the Mabs corresponding to the 15th and 85th percentile values 
from the cumulative PDF. We also retain the full PDFs so that 
they can be used instead of single estimates of distances and 
their errors. 

Figure |2] shows a comparison of the results from running 
the two versions (x^ and Bayesian) of the code o n stars from 
a moc k galajg_ field generated by the Besan^on (iRobin et alJ 
l2003h modelOj Uncertainties on the stellar parameters for all 
stars in the mock catalog were set to CTeff = 100 K, criog g = 
0.3 dex, and cr[pe/H] = 0-2 dex. The left panel compares the 

residuals (in the sense (£/LAMOST-t/Besancon)/<iBe.sancon) of mea¬ 
sured distances to the input (model) distances. Both methods 
recover the input distances fairly well, with an asymmetric tail 
to high (overestimated) residuals. The right panel of Figure|2] 
compares these distance measurements directly. It is clear 
from this panel that the large positive residuals are mostly for 
distant, metal-poor giants, whose distances are overestimated 
by ^ 20%. Having now verified that the and Bayesian 
methods produce similar results, we henceforth use only the 
Bayesian method to obtain the full distance PDF. 

3. VERIFICATION OF THE METHODS 

To test the code, we need a sample of stars with known 
spectroscopic parameter s and distances. Eor this, we use the 
iGrav et aTT (I2003L 120061) measurements of stellar parameters 
for Hipparcos stars within 40 pc of the Sun as part of the 
NStars program. These nearby stars have good-quality paral¬ 
laxes ((T^/tt < 0.05), and are all found in the 2MASS catalog. 
Uncertainties on the stellar parameters for individual stars are 
not provided in the catalog; we choose to set them to crxeff = 

Throughout this work, we use the notation i^lamost to refer to distances 
derived by the algorithm discussed in this paper. Note that this does not 
mean that these distances are derived using LAMOST data, but rather that 
our LAMOST distance code is being applied. 



{^LAMOST"^Hipparcos)'^^Hipparcos 


Fig. 3.— Results of running the distance code on 1199 stars from the 
IGrav et ^ <20031120Q6J) catalogs. The histogram shows fractional residuals 
from comparison of the Hipparcos parallax distances, dnipparcos-, to distances 
derived from the Gray et al. stellai' parameters, <iLAMOST- For reference, the 
dashed vertical line is one-to-one agreement. Our distances show no system¬ 
atic offset with respect to the parallax distances, with ~ 17% scatter. 


100 K, fjiog g = 0.3 dex, and (T[pe/H] = 0-2 dex. Furthermore, we 
cannot rely on the method used for LAMOST data, where we 
selected stars from the same observed plate to derive the se¬ 
lection and luminosity functions. For this and all subsequent 
tests of our code, we obtain the underlying luminosity distri¬ 
bution using only the color-magnitude selection from the al¬ 
gorithm outlined in Section 2.3, which we apply to all stars in 
the test catalog without regard to position on the sky. Figure[3] 
shows the results of running our distance code using the Gray 
et al. stellar parameters plus 2MASS magnitudes and colors 
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Fig. 4.— Left panel: results of running the distance code on ~ 16,000 stars from RAVE in the IBreddels et all 1201011 catalogs. As in Figurethe panels 
represent the fraction al residuals relative to the catalog values from the Bayesian method. Our distances a re systematica l ly shif ted by ~ 26% relative to those of 
IBreddels et al.l <2010h . with ~ 23% scatter. Right panel: as in the left panel, but for the RAVE sample from IZwitter et all 1201011 . Our distances are systematically 
smaller by ~ 12%, with ~ 16% spread about this value. There are decidedly non-Gaussian tails in these residuals on toward large relative underestimates. 




Fig. 5.— Results from running the distance code on two realizations of the Besanjon Galaxy model. The left panel is for a field at / = 180°,h = 30°, and the 
right panel shows a field at / = 180°, b = 60°. 


for stars with temperature in the range 3500 K< Teff < 9000 K. 
This temperature cut reduces the sample from a total of 1525 
stars to 1199 used for Figure [3 The histogram compares the 
known distances, tfuipparcos (from the trigonometric parallaxes) 
to the derived distances, £/lamost, expressed as a residual: 
(<iLAMOST-<iHipparcos)/c/Hipparco.s- The residuals are centered on 
zero (i.e., no systematic offset is present), with a scatter of 
^17% (median absolute deviation of the data; we use this in¬ 
stead of fitting a Gaussian because the residuals are obviously 
asymmetric and non-Gaussian). Because of their location in 
the Solar neighborhood, the majority of these stars are metal- 
rich main-sequence stars (1151 are dwarfs with logg < 3.5, 
and 48 are giants). Thus, while these stars are a useful test, 
they do not explore the variety of stellar populations we ex¬ 
pect to find in a survey such as LAMOST. 

To examine our algorithm’s behavior on a more hetero¬ 
geneous data set, we test our code on two samples of 
RAVE DR2 stars. The first sample is the RAVE-6D cat¬ 
alog: lrtt£j_/_Aww ■ ast ro . rug. nl/~rave/, which is 
from IBreddels et akl (l2010t) . We use the RAVE stellar param¬ 


eters from this table as inputs to the distance code. Uncer¬ 
tainties on the stellar parameters were set to crxeff = 100 K, 
(Tfog g = 0.3 dex, and cr[pe/H] =0.2 dex to approximate the typi¬ 
cal errors in RAVE. The left panel of Eigure|4ls hows residuals 
from a comparison of our results to those from iBreddels et al] 
(l2010h . There is a sy stematic shift of ^ 26 % between our 
distances and those of IBreddels et alJ (l2010h . with scatter of 
23%. On close examination, there is no obvious correla¬ 
tion of the distance residuals with any of the input stellar pa¬ 
rameters (e.g., logg, [Ee/H]). Thus th e systematic of f set be- 
tween our distance scale and that of [Breddels et alJ (120101) 
may be due to differences in the isochrones used in the fit¬ 
ting. IBreddels et alJ (1201 Oh used Yale-Yonsei isochrones in a 
grid with 40 logarithmically spaced ages between 0.01 and 
15 Gyr. The differences between the Dartmouth and Yale- 
Yonsei iso chrones, and the heav y emphasis on very young 
ages in the IBreddels et al.l (1201 Oh grid, seem to cause system¬ 
atic shifts. Reassuringly, when we applied our code, which 
defines in the same way as IBreddels et al.l (I2010h . to these 
data, the scatter about the mean difference is small (but the 




















































Distances to Stars With LAMOST Stellar Parameters 


7 


systematic offset remains). 

The second s et of RAVE data on which we tested the code 
is the catalog of IZwitter et al.l (12 0101). The se authors improved 
upon the method of iBreddels et al.l (120101) by using a linearly 
spaced grid of ages, deriving distances separately using Yale- 
Yonsei, Padova, and Dartmouth isochrones, and by weight¬ 
ing stages of stellar evolution to account for the relative num¬ 
bers of stars of different masses. We ran our code on the pa¬ 
rameters of r oughly 16,000 stars 5800 giants and ^ 9900 
dwarfs) from IZwitter et al.l (1201 Ol) and compared directly to 
their Dartmouth results. The comparison is shown in the right 
panel of Figured with residuals calculated in the same way 
as for the Breddels et al. sample. Again, these residuals show 
a systematic shift such that our estimates are lower than the 
RAVE distances. The systematic difference between our re¬ 
sult a nd the RAVE distance s is smaller than for the compari¬ 
son to lBreddels et al.l (1201 Oh . as expected since the grid spac¬ 
ing in age is similar and we are using the same isochrone sets 
(Dartmouth). Our Bayesian method produces decidedly non- 
Gaussian residuals, with a median offset of ^ 12% and scatter 
of - 16%. 

As a final test of the accuracy of our derived distances, we 
generate catalogs u sing the Besan 9 on model of the Milky Way 
dRobin et al.ll2003h for two fields of view: {l^b) = (180°, 30°) 
and (l,b) = (180°,60°). The b = 60° catalog contains 5614 
stars (299 giants and 5315 dwarfs) with 3500 K< Teff < 
10000 K; the b = 30° field has 469 giants and 13384 dwarfs 
in the same temperature range. We again assign uncertain¬ 
ties to the stellar parameters of crTeff = 100 K, aiog g = 0.3 dex, 
and cr[Fe/H] = 0.2 dex. Eigure |5] shows the results of running 
our algorithm with the stellar parameters from the Besan 9 on 
model as inputs. We recover the distances well, with a mildly 
bimodal distribution and a slight tail extending to — 50% over¬ 
estimate of distances. These residuals have roughly zero me¬ 
dian offset, but it is clear that a large fraction of stars have dis¬ 
tances underestimated by — 5%. The positive “bump” in these 
residuals appears to consist mostly of the youngest nearby 
stars and old, metal-poor halo giants in the Besan 9 on model 
catalog. 


4. APPLICATION TO LAMOST DATA 

Having verified the effectiveness of our distance code on 
catalog and simulated data, we now apply the code to the ex¬ 
isting LAMOST data. As of this date, the LAMOST catalog 
(internal data releases 1 and 2) consists of — 1.8 million stars 
with stellar parameters (out of — 3.6 million that have been 
observed; stars with low S/N, cool M-type stars, and hot OBA 
stars do not have parameters from the LAMOST pipeline). In 
this section, we show some simple “sanity checks” to verify 
that the code is producing reasonable results, and to provide 
an idea of the scope of the LAMOST data set. 

4.1. LAMOST Stellar Parameters 

The LAMOST parameters for stars in the range 3500 K< 
Teff < 8000 K and with S/N> 5 in g and r bands have median 
uncertainties of — 160 K, — 0.5 dex, and — 0.3 dex, in Teff, 
logg, and [Le/H], respectively. We note that the [Le/H] un¬ 
certainties in the second full year of survey operations (2013 
September - 2014 June) are significantly smaller (median 
0.18 dex) than the earlier periods; the Teff and logg errors are 
similar in earlier and later data. It is unclear whether this is 
due to changes in the LAMOST data reduction pipeline, or 
improved data quality as the survey progresses. 



O 

■Q 


0.0 0.2 0.4 0.6 0.8 1.0 

fractional distance error 



Fig. 6 .— Fractional errors on the derived distances for LAMOST stars 
compared to the uncertainties on the stellar parameters Tgff, log^, and [Fe/H] 
(from top to bottom). The grayscale encodes the number of stai's in each bin 
on a logarithmic scale between 10 and 20,000. The distance errors are only 
weakly correlated with Tgff or [Fe/H] uncertainties. By far the strongest de¬ 
pendence is seen in the center panel, which shows a roughly linear correlation 
between surface gravity uncertainty and the error in the derived distance. 


The parameter that most strongly affects the derived dis¬ 
tance errors is surface gravity. This can be seen in Eigure |6] 
which compares the errors on Teff, logg, and [Le/H] with the 
error in derived distances based on those parameters. There 
is a slight correlation of crTeff and crj, but little dependence of 
distance errors on uncertainty in [Le/H]. The middle panel, 
showing (Tiog g vs. tJd, exhibits a roughly linear correlation be¬ 
tween the surface gravity uncertainty and the errors on the de¬ 
rived distances. Lor an uncertainty of — 0.5 dex in logg, Lig- 
ure|6]suggests that we can expect a — 25 - 35% distance error. 
It is thus vital that surface gravities fr om LAMOST spectra 
are determined as precisely as possible. iLiu. C.. et al.l (I2014bl) 
recently published a method to improve logg estimates for gi¬ 
ant stars in the Kepler field that have also been observed with 
LAMOST. Based on corrections from comparison to astero- 
seismic logg measurements from Kepler, Liu et al. obtain 
uncertainties in logg from LAMOST spectra of — 0.1 dex, 
which yields distance estimates with better than 10% preci- 
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^a-enhanced ^solar a)^^solar a 


Fig. 7.— Difference between distances to halo giants (|Z| > 3 kpc, 
-2.4 <[Fe/H]< -1.0, with S/N> 10 in g and r-band) measured with solai- 
scaled isochrones ([o/Fe] = 0.0; labeled “solar q”) and an a-enhanced 
([a/Fe] = 0.4) grid of isochrones. On average, the a-enhanced isochrones 
find distances ~ 13% smaller (derived via the dashed-line Gaussian fit shown 
above) than those from the solar-scaled grid. 

sion. Indeed, at a given temperature, AMabs ~ 2.5A(logg)0 
so if the uncertainty of logg is improved by 0.1 dex, the un¬ 
certainty in absolute magnitude improves by 0.25 mag, and 
the accuracy of the distance estimate improves by ~12%. 

4.2. Effect of a-element Abundances on Distances to 
Metal-poor Halo Giants 

As noted in Section 2, our algorithm tends to overesti¬ 
mate the distances to metal-poor halo giants in synthetic cat¬ 
alogs from the Besan 9 on model. It is well established that 
the metal-poor stellar populations of the Milky Way halo 
are typical ly enhanced in g -elements relative to disk popula¬ 
tions (e.g.. lVenn et al.ll2004h . with metal-poor ([Fe/H]< -1.0) 
halo stars typically having [a/Fe]ss 0.4. We now return to a 
subset of stars for which we have LAMOST stellar param¬ 
eters, and examine the effect of replacing the solar-scaled 
isochrones with a-enhanced versions in our distance code. 
To do so, we generate a new isochrone grid with [a/Fe] = 
h- 0.4, [Fe/H]< -1.0, and the same steps in age as the orig¬ 
inal isochrone set. We run our distance algorithm on a set 
of stellar parameters from 239,446 LAMOST spectra (com¬ 
prised of recent, third-year LAMOST spectra) with the a- 
enhanced isochrones. From the resulting distance catalog, we 
then select out only likely metal-poor halo stars with S/N> 10 
in g,r-bands, -2.4 <[Fe/H]< -1.0, logg < 3.5, and at least 
3 kpc from the Galactic plane. This produces a sample of 542 
likely halo stars. Figure [T] compares the distance from the a- 
enhanced grid to the distance from the original isochrone grid, 
in the sense (iia-enhanced-<isoiar a)/t^soiar a- We find that the 
[a/Fe] = h- 0.4 grid produces distances on average 13% nearer 
than those from the [a/Fe] = 0.0 grid. This likely explains the 
^ 20% systematic overestimation of distances for halo stars 
from the Besan 9 on catalogs. Because halo populations in 
the Besanco n model were oxy gen-enhanced relative to disk 
populations (iRobin et al.l 120031) . our assumption of solar a- 
abundances likely biases the derived distances. Adopting a 
more appropriate a-enhanced isochrone grid for metal-poor 
halo stars would remedy this situation. Indeed, one would ide- 

Because oc —2.51ogL, LocR^, andg oc Thus Mabs oc 2.51ogg. 



Number of repeat obs. 


Fig. 8 .— Number of repeat observations of the 214,514 unique stars with 
multiple spectra in the LAMOST database. The majority of these objects 
have fewer than 4 observations, but some have as many as 14 separate spectra. 

ally incorporate a measured [a/Fe] from the LAMOST spec¬ 
trum itself into the distance estimation for each star; we will 
include this in future upgrades to the distance code as abun¬ 
dance estimates become available for LAMOST stars. 

4.3. Internal LAMOST Checks on Repeat Observations 

Of the ^1.8 million stellar spectra in the LAMOST cata¬ 
log, 550,000 of them 30%) are stars with repeat obser¬ 
vations. There are 214,514 unique stars that have been ob¬ 
served multiple times and have sufficient quality spectra at 
each epoch to derive stellar parameters. An individual star 
may have as many as 14 observations, but most have 2-4 ob¬ 
servations; the distribution of the number of repeat measure¬ 
ments is shown in Figure 0 Figure |9] shows the standard de¬ 
viation of our distance measurements for stars with multiple 
observations. This is expressed as a fractional deviation of the 
mean measured distance, (Td/dmean, and plotted as a function 
of the minimum signal to noise of the measurements being 
compared. One would expect that the scatter in derived dis¬ 
tances would increase if one (or more) of the ^ectra has low 
S/N. This is precisely what is seen in Figure |^- the scatter 
is ^ 5% for spectra with minimum S/N> 20, and begins to 
rise for S/N below 20. However, even when the minimum 
S/N is as low as 2.5, the typical scatter in distances in only 
^ 20%. This verifies that (a) our code produces repeatable re¬ 
sults when applied to multiple observations of the same star, 
and (b) the LAMOST pipeline provides consistent estimates 
of stellar parameters from these multiple observations. 

4.4. Results from LAMOST Data 

After running our distance code on the entire catalog of 
LAMOST stellar parameters, we perform some checks to ver¬ 
ify that the results make sense, and to explore the utility of our 
distances for Galactic structure studies. Using our distances, 
we calculate Galactocentric Cartesian coordinates (assuming 
the Sun is at Rq = 8 kpc, with (A,T,Z)sun = (-8,0,0) kpc). 
The first test is to see whether the metallicity distribution as 
a function of height above the Galactic plane near the north 
Galactic cap is as expected. We select stars atb> 60°, keep¬ 
ing only those with S/N> 10 in the SDSS g-band. This yields 
189,106 stars. This sample should roughly probe the Galac¬ 
tic metallicity gradient with height; one expects that on aver¬ 
age the metallicity should be nearly solar close to the plane. 






















Distances to Stars With LAMOST Stellar Parameters 


9 



min. S/N 


Fig. 9.— Standard deviation aj of the distance estimate for the 214,514 
stars with multiple measurements in LAMOST. This is expressed as a frac¬ 
tional deviation of the distance, crj/tfmean, as a function of the minimum sig¬ 
nal to noise of the spectra included in the derivation of aj for each star. We 
calculated mean scatter (filled diamonds) and its standard deviation (error 
bars) for these results in bins of 2.5 in S/N. The scatter from repeat measure¬ 
ments is typically ~ 5% for high S/N stars (min. S/N> 20), then increases 
to ~ 20% at the low S/N end. This suggests that even for fairly poor quality 
spectra, our distance derivations (and thus the stellar parameters on which 
they are based) are robustly repeatable. 



Z (kpc) 

Fig. 10.— Metallicity from the LAMOST pipeline versus height above the 
plane for a sample of 189,106 stars aXb > 60° that have S/Nj >10. The Z 
coordinate is based on distances derived by our code using LAMOST stellar 
parameters. Contours contain (2, 5, 10, 25, 50, 100, 200, 300, 400, 500, 
750, 1000, 1500, 2500, 4000) stai's. As expected for disk stars, the mean 
metallicity falls from near solar just above the Galactic plane to ([Fe/H]} ^ 
-0.6 near Z ~ 1 kpc. This peak metallicity, which is typical of the Galactic 
thick disk, persists as far out as we probe, with a long tail to lower metallicity. 


and decrease with height as the thin disk transitions into the 
lower-metallicity thick disk. Indeed, this is exactly what is 
seen in a contour plot of these data in Figure [TO] The peak 
metallicity decreases from slightly subsolar at Z ^ 0.3 kpc to 
[Fe/H]~ -0.6 at Z ~ 1 kpc. Above this, the peak metallicity 
remains roughly the same, with a long tail to low metallicities 
representing predominantly local halo stars. 

Though giant stars in the Galactic halo represent a tiny frac¬ 
tion of the stars observed by LAMOST, we also hope to use 
them to explore structure (and substructure) in the halo. We 
thus wish to check whether our distances can be used to iso¬ 
late a relatively pure sample of Milky Way halo giants. To test 
this, we select stars with Galactocentric radii Rqq > 20 kpc 
that are also at heights |Zgc| > 5 kpc above/below the plane. 
Such a sample of stars should be predominantly halo stars. 



[Fe/H] 


Fig. 11.— Normalized metallicity distribution of 1,473,135 stars at Rqq < 
20 kpc and |Z| > 2.5 kpc (solid histogram). The 1,705 stars at |Z| > 5 kpc 
and Rqc > 20 kpc are represented by the dashed line. The latter should be 
mostly halo stars, and peaks at [Fe/H]^ —1.5 as expected for the Galactic 
halo, with few metal-rich ([Fe/H] > —1.0) stars. The |Z| < 2.5 kpc sample 
contains mostly metal-rich stars, as expected for predominantly disk popula¬ 
tions. The abrupt cutoff at [Fe/H]= —2.5 is due to the lower limit o f metallic¬ 
ities p roduced by the LAMOST pipeline rather than a real effect. ILee et all 
1201511 adapted the Sloan SSPP for more general usage; use of this pipeline 
on LAMOST spectra avoids the artificial cutoff at [Fe/H]= -2.5. 

We check this by plotting a metallicity histogram (dashed line 
in Figure [nil for the 1528 stars selected in this way. These 
stars peak at a metallicity around [Fe/H] ^ -1.5, as expected 
for inner-halo stars, with very few metal-rich stars. In con¬ 
trast, a sample selected to be inside Rqq < 20 kpc and near 
the disk (|Zgc| < 2.5 kpc; solid line in Figure flTT) contains 
mostly metal-rich stars with disk-like [Fe/H]. 

Note that neither of these sanity checks showing metallic¬ 
ity distributions for different Galactic populations (Figures [TO] 
and fTTT) represents the true Galactic metallicity distribution for 
these populations. To derive the intrinsic distribution would 
require correcting the selection effects present in LAMOST 
data. These figures are simply meant to illustrate that stellar 
samples selected using our derived distances have properties 
similar to what one might expect based on our knowledge of 
metallicity distributions of Milky Way components. 

Finally, we search the LAMOST database for open 
cluster member stars. We begin with the compi¬ 
lation of known Galactic open clusters av ailable at 
http : / /WWW. astro . lag. usp . br/ocdb/ (iDias et al.l 
120021) . For each cluster in this list, we initially selected all 
stars from LAMOST within the published cluster diameter 
that also have LAMOST RV within 20 km s“' of the published 
value (note that we only used clusters with known RVs for 
this exercise). After this initial cut, we examined histograms 
of velocities, distances, and metallicities for each of the clus¬ 
ters with more than 15 candidates. For clusters with obvious 
distance and velocity peaks, we manually select stars within 
^ 10 km s“' of the peak value, and fit a Gaussian to the dis¬ 
tance distribution of these cluster candidates. Clusters with 
obvious signatures were NGC 1039, NGC 1662, NGC 2168, 
NGC 2281, NGC 2548, ASCC 26, and NGC 1647. The num¬ 
ber of candidates selected ranged from 19 to 102 stars, with 
the nearest clusters having the most candidates. Figure [12] 
compares our measured distances (from the Ga ussian fits) 
for these seven clusters, £/lamost, to those from iDias et alJ 
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(iiit. Error bars on these points represent the Gaussian 
a of the stars included. The dashed line corresponds to one- 
to-one agreement between our measurements and literature 
values. All but one of the seven clusters’ distances is consis¬ 
tent with values from the literature. Thus, we have conhrmed 
the effectiveness of our distance estimations. This simple ex¬ 
ercise highlights the potential of LAMOST to amass a sample 
of open cluster stars with homogeneously measured metallic- 
ities, velocities, and distances that can be used to probe the 
Galactic disk in exquisite detail. 

5. CONCLUSIONS 

We present a method to derive distances to stars with mea¬ 
sured stellar parameters (Teg, logg, and [Fe/H]). This was de¬ 
veloped with particular interest in deriving distances to the 
many millions of stars that will be observed by the LAMOST 
survey, in order to enable studies of Galactic structure with 
this vast data set. The code is based on a Bayesian method 
that evaluates the posterior PDF in absolute magnitude for 
a given star, estimated via comparison to a grid of theoreti¬ 
cal isochrones. The PDF incorporates information about Teff, 
logg, and [Fe/H], along with their uncertainties. To account 
for selection effects, we take advantage of the fact that each 
LAMOST plate typically observes ^ 3000 stars simultane¬ 
ously. The observed distribution of logg for stars within 0.25 
magnitudes in color-magnitude space is mapped onto theoret¬ 
ical isochrones to derive a proxy “luminosity function” ex¬ 
pected for stars in that region of sky at the given color and 
magnitude. This accounts simultaneously for the selection 
function through which stars were chosen for LAMOST ob¬ 
servation, and the variation in stellar populations with Galac¬ 
tic line of sight (and distance). A flat age prior is imple¬ 
mented, since we have no information about the ages of in¬ 
dividual stars. This could, in principle, be modified to ac¬ 
count for the relatively well-known age-metallicity relation 
in the Milky Way, but we choose to leave it flat so as not to 
bias studies of Galactic stellar populations based on LAM¬ 
OST data. Likewise, we do not impose any priors related to 
Milky Way stellar populations; since we wish to study Galac¬ 
tic structure, we prefer to avoid introducing assumptions into 
our distance calculations. 

We test our code by measuring distances to samples of stars 
from Hipparcos and RAVE that have known stellar param¬ 
eters. Our distances agree with the parallax distances from 
Hipparcos, with roughly 17% scatter in the residuals. We 
And a 12% s ystematic shift betw een our distances and those 
measured bv Iz witter et al.l (1201 Ol) from the same RAVE sam¬ 
ple, with only 16% scatter, but with a large tail toward un¬ 
derestimate of the distance. We also test our code on simu¬ 
lated data along two lines of sight from the Besan 9 on model, 
and And that we recover the model distances with no net 
offset. There is, however, an apparent ^ 20% overestimate 
of distances to distant, metal-poor halo stars by our code. 
The source of this systematic shift is unclear, but it may be 
due to the fact that we have used isochrones with solar a- 
element abundances. Stars in the Milky Way halo are, on 
average, old an d g-enhanced rela tive to the Solar neighbor¬ 
hood (see, e.g.. IVenn et al.ll2004h . Because the RGB of an 
isochrone is shifted slightly to fainter absolute magnitudes 
when [a/Fe] is increased (at Axed [Fe/H] and age), the use 
of Solar-scaled isochrones will tend to bias results for a-rich 
stars toward overestimation of distances. We confirmed this 
impression by re-running the distance derivation for a sub¬ 
set of metal-poor halo giants, using an a-enhanced isochrone 
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Fig. 12.— Comparison of o ur derived distances (<^lamost) to those from 
the literature <Dias et al.ll2QQ2L dy^x) for seven open clusters found in LAM¬ 
OST. Points represent the central value of the best-fitting Gaussian for each 
cluster’s distance distribution, and eiTor bars sho\v the Gaussian <j. The 
dashed line represents one-to-one agreement. All but one of the clusters 
agrees very closely with the known distance. 


grid ([a/Fe]= -1-0.4). Indeed, this exercise shifted the distance 
estimates for these stars by an average of 13% closer than 
the estimates based on the Solar-alpha grid. Thus we suggest 
that adopting a more appropriate a-enhanced isochrone set for 
metal-poor halo stars would remedy the overestimation of dis¬ 
tances to these stars, and that ultimately the measured [a/Fe] 
should be incorporated into the distance estimation process. 

Finally, we present some results based on LAMOST data. 
A sample of ~ 189,000 stars near the north Galactic cap 
shows expected trends in [Fe/H] with height (Z) above the 
Galactic plane. We also show that a sample selected to be 
distant halo stars based on our derived distances consists 
solely of metal-poor stars with a distribution peaked around 
[Fe/H]~ -1.5, as expected for the Galactic inner halo. Some 
studies of kinematics of nearby stars based on LAMOST data 
and with distances from this code have alre ady appeared in the 
literature (iTian et al.l2015tlXia et al.ll2015h . and we anticipate 
that distances derived by this method will prove useful for nu¬ 
merous upcoming studies of Galactic structure. Furthermore, 
as the LAMOST data reduction pipelines continue to improve, 
we anticipate that uncertainties on stellar parameters derived 
from the spectra will become smaller, which will in turn im¬ 
prove our estimates of the distances. Eventually, parallaxes 
that will come from the Gaia mission (iPerrvman et al.ll2001h 
will likely supersede these distance measurements for the ma¬ 
jority of the stars observed by LAMOST, as part of a vast sam¬ 
ple of direct distance measurements throughout the Galaxy. 
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